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Abstract 

In [8] the large deviations limit liniAr^ 00 (A r a) _1 log EZ^ for the moments of the 
partition function Zn in the Sherrington-Kirkpatrick model [5] was computed for all 
real a > 0. For a > 1 this result extends the classical physicist's replica method that 
corresponds to integer a. We give a new proof for a > 1 in the case of the pure p-spin 
SK model that provides a strong exponential control of the overlap. 
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1 Introduction and main results. 

For integer N > 1 we consider S^r = {— 1,+1} and a Gaussian Hamiltonian (process) 
H N (er) indexed by cr G T, N . We assume that its covariance satisfies 

^EH N (<T 1 )H N (<r 2 ) = aRi,2), (1.1) 



where 



i<N 



is the overlap of configurations cr 1 , cr 2 and £ is a smooth function such that 

£(0) = 0, £(x) = £(-*), C"(x) > if x > 0. (1.3) 
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In fact, our main results will be obtained in the case when £(x) = f3 2 \x\ p for p > 2. When 
p > 2 is an even integer, the corresponding Hamiltonian 

l<h,...,i p <N 

is called a pure p-spin SK Hamiltonian. Here (<?u,...,j p ) are i.i.d. standard Gaussian random 
variables. Let us define a function 9(x) = x^'(x) — £{x) that due to convexity assumption on 
£ satisfies 

A(a,b) :=£(a) -ag(b) + 9(b) > for all a, b e R. (1.4) 

This property will be crucial in the same way it was crucial in all recent progress in the SK 
model, because it yields the positivity of error terms in Guerra type interpolations and, as 
a result, allows us to control them. Given the external field parameter h G M, we define the 
partition function by 

Z N ="£ exp(lM<r) + /i5». (1.5) 

o-eSjv i<N 

The main results of our paper are motivated by a problem considered in [8]. The goal of that 
paper was to identify the following large deviations limit for the partition function 

V(a)= hm -^logEZ^ 

for all a > 0. It was shown that the cases < a < 1 and a > 1 are very different. The first 
case is of the same nature as the Parisi formula, proved rigorously in [7], that corresponds 
to a = 0. The second case a > 1 is the generalization of the so called replica method that 
corresponds to integer a > 1. In our paper we will only be interested in this second case and, 
in particular, the following problem. The main result for a > 1 in [8], Theorem 9.4, says that 
if a > 1 then 

Via) = max RS(g) (1.6) 
?e[o,i] 

where 

RS(q) = log2 + l - (C(l) - ?(q) + (1 - a)%)) + 1 logEch a (^ + h). (1.7) 

and where a r.v. z has normal distribution 7V(0, £'(#)). (It is interesting to note that RS 
is convex in a by main result in [4] which, otherwise, is not at all obvious.) The proof of 
(1.6) was based on a beautiful convexity argument in the spirit of the proof of Ghirlanda- 
Guerra identities [1]. The proof also suggested that, assuming that the supremum of RS(g) 
is achieved at a unique point qo, the distribution of the overlap under a certain change of 
density should be concentrated near go if the external field h ^ or, by symmetry, near 
±g if h — 0. This type of behavior was witnessed in a weak sense - on average over some 
small perturbations of the Hamiltonian H N (cr). In the present paper we achieve a strong 
exponential control of the overlap for pure p-spin case, £(x) = /3 2 |a;| p for p > 2. 

In order to formulate and motivate the results let us sketch the starting point of the 
proof of (1.6) in [8] which gives the lower bound V(a) > sup g RS(g). This is based on Guerra's 
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interpolation. Let us consider an interpolating Hamiltonian for < t < 1 given by 

H t ((T) = ViH N ((T)+VT^tJ2 Z i a i + h J2 a ^ ( L8 ) 

i<N i<N 

where (;Zj)j<jv are i.i.d. N(0,£'(q)). Define the partition function as above 
Z t = exp^(o-) and let (p(t) = -^logEZf. 

_ i V (Jj 

A standard Gaussian integration by parts then shows that 

<f/(t) = - £'(?) + (1 - a)9{q)) + l -{a - l)E'<A(it! 1)2 , g)> 

where (•) is the Gibbs' average with Hamiltonian H t (cr) (we keep its dependence on t implicit) 
and E' denotes the expectation with the following change of density 

V/, E'f = ET/ for T - Zt 



Since 

<p(0) = log2 + -logEch a (^ + /i) 

(X 

we get 

1 f 1 

tp{l) = RS(g) + -(a-l)J E'(A(R h2 , q))dt. (1.9) 

By (1.4), A(x) > and we get V(a) > RS(g). In order to show that this lower bound is 
sharp for q = q we need to show that the last integral in (1.9) is small. The strong control 
of A(i? 1)2 ,<7o) along the interpolation < t < 1 was obtained in [8] for 1 < a < 2 but, 
as we mentioned above, the matching upper bound for general a > 1 was given only in a 
weak sense. Of course, the fact that we have a matching upper bound implies that the above 
integral is small and, thus, 

lim / E'(A(R 12 ,qo))dt = (1.10) 

By itself this does not exclude large values of E'(A(i2 lj2 , go)) a t some exceptional points t 
but our first simple observation does. 

Theorem 1 For all a > 2 and < t < 1 we have 

lim E'(A(i2 li2 ,g )) = 0. 

Proof. A simple computation by Gaussian integration by parts gives (we omit go in A for 
simplicity of notations) 

2A^E'(A( J R 1 , 2 )> = E'(A( J R li2 ) 2 ) + 2(a-2)E'(A( J R li2 )A(i? li3 )) (1.11) 

+ \{a - 2)(a - 3)E'(A(i? 1)2 )A(i? 3 ,4)) - \a{a - l)(E'(A(i? li2 ))) 2 . 



Holder's inequality applied either to (■) or E' implies 



E'(A(i? li2 ) 2 ) > E'(A(i2 li2 )A(i2 1)3 )) > E'<A(i2 1)2 )A(i2 3 ,4)> > (E'(A(i2 li2 )» 2 . 

Since the sum of all coefficients on the right hand side of (1.11) is zero and the first two 
are nonnegative for a > 2, the above derivative is nonnegative so that E'(A(i? 12 )) is nonde- 
creasing. Together with (1.10) this proves the result. 

□ 

This, however, does not give us control of the overlap at t — 1 and even for t < 1 it 
is still a rather weak statement. For example, this does not show that the third moment 
\Ri,2 — <?o| 3 is of a smaller order than the second moment which is needed in order to carry 
out second moment computations and prove central limit theorem for the overlap. Below we 
will formulate a stronger statement. 

Let us start with a remark about maxima of RS(g). It is easy to check that the critical 
point condition is 

Ech a (z + h)th 2 (z + h) , Ar/ ^ xx 

^ - „/ ^ '- = q where z~M0, £'(q)). (1.12) 

We do not know how to prove that this equation has a unique solution, even though numerical 
observations show that this seems to be the case. For a = 0, Lemma 2.4.8 in [6] (Guerra- 
Latala) proves that such solution is unique by showing that a function q — > Eth 2 (^ + h)/q is 
decreasing. This, however, is not always true for a > 1. 

Below we will prove that the overlap essentially can not take values u G [—1, 1] such 
that RS(|w|) < maxRS(g). Under an additional (easy-to-check) assumption that RS(g) has 
a unique maximum at go this will imply that the overlap is strongly concentrated near q . 
By now standard techniques (for example, Sections 2.6, 2.7 in [6]) one can then carry out 
second moment computations and prove central limit theorem for the overlap. 

Let us note that the overlap can only take values i?i ;2 = k/N for integer —N < k < N. 
For simplicity of notations throughout the paper when we write i?i j2 = 116 [— 1, 1] we mean 
that i?i i2 = mtv for some sequence (u^) such that limAr^ooWTv = u- We would like to show 
that (for t = 1) 

E'(I(R lj2 = u)) < Lexp(-N/L) (1.13) 

if one of the following holds: 

1. h ^ and either u < or u > and RS(tt) < V(a). 

2. h = and RS(|u|) < V(a). 

Here and everywhere below L denotes a constant that does not depend on N. We need to 
separate these two cases because without external field, h — 0, the distribution of overlap is 
symmetric. In order to prove (1.13), we will prove a stronger statement. For n > 1 let 

C n = \u = (wi,i')i<i,i'< n : U T = U > 0, «,,,/£[-!,!] and u M = lj. 
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Given U G C n , we will write {Ri t i> = to denote a set of all spin configurations cr 1 , . . . , a n 
such that Rip = uiy for all 1 </,/'< n. In fact, since it will always be absolutely clear 
from the context, we will abuse the notations and simply write U to denote {Ri,i> = uij'}. 
We define a product partition function with corresponding constraints on overlaps by 

Z n (U) = exp (J2 Hn(<t 1 ) + hJ2 £ ^) ■ (1-14) 

U l<n l<n i<N 

Also, for n = 2 and u G [—1,1] we will write Z 2 (u) instead of Z 2 (U) for a matrix U G C 2 
such that Mi )2 = m. Let n be such that 

n<a<n+l. (1.15) 

Then we can write 

E'(/(i2 1>2 = «)) = ^f^Z 2 {u) < ^E(Z^Z 2 (u)) a/(n+1 \ 

since the last inequality (for the integrands) is equivalent to Z 2 (u) < Z%. Therefore, (1.13) 
will follow from 

llogE(zr^ 2 H) a/(n+1) < ^logEZ^ - \. 

Since each overlap takes at most IN + 1 values it should be obvious that the left hand side 
is equivalent for iV — > oo to 

sup4logEZ n+1 (Vy) a /(" +1 ) 

where the supremum is taken over all W G C n+ i such that w^ 2 = u. Thus, for pure p-spin 
model exponential overlap control in (1.13) follows from our main result. 

Theorem 2 Suppose that £(x) = [3 2 \x\ p for p > 2 and u G [—1, 1] satisfies one of the 
conditions in (1.13). IfWE C n +i is such that w^ 2 = u then 

lim ^-\ogEZ n+1 (W) a ^ n+1 ^ < V(a) - \ (1.16) 
N-*oo iva L 

for some L > 0. 

The only part of the proof that uses the specific choice of £(x) = (3 2 \x\ p is Lemma 3 below. 
Generalizing Lemma 3 would immediately yield Theorem 2 for other choices of £, for example, 
mixed p-spin Hamiltonians. 

Let us mention that the control of the overlap for pure p-spin model provided by The- 
orem 2 generalizes the so called replica method which corresponds to integer a = n. For 
completeness let us formulate this well known result. 

Theorem 3 Suppose that a is integer, £ satisfies (1.3) and u G [—1, 1] satisfies one of the 
conditions in (1.13). Then (1.13) holds. 

The proof of this result follows by well known techniques, see Section 2.15 in [6] or [9]. 
The result is proved in the Appendix A of [9] only for h = (the authors attribute the proof 
to Elliott Lieb) but essentially the same argument can be extended to h ^ 0. For additional 
comments see remark following Lemma 2 below. 
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2 Proof of Theorem 2. 



First of all, let us note that we write a limit on the left hand side of (1.16) instead of limsup 
because the limit exists. It follows from a standard superadditivity argument by using Guerra- 
Toninelli interpolation as in [2]. We only mention that a condition a/(n + l) < 1 is important 
because it implies that a derivative in Guerra-Toninelli interpolation has a correct sign. 
Suppose that all conditions of theorem are satisfied but 

hm -^logEZ„ +1 (WT A " +1 ) = V(a). (2.1) 

It will be convenient to assume that one of the elements in the last column of W instead of 
Wi 2 is equal to u. Suppose that 

where U G C n , u = (ui, . . . , u n ) T and one of the coordinates of u is equal to u. The inequality 
Z n (U) a/n < Z% together with (2.1) implies that 

limsup -J- log EZ n (U) a/n < nV{a) = lim -^-\ og EZ n+l {W) a ^ n+1) . (2.3) 

jv^oo Na n^oo Na 

We will first obtain a lower bound for the left hand side of (2.3). Given a vector zel" and 
A = i<i<i' <n we define a function 



$„(z,A) = esxp(^2ei{zi + h) + ^2\i t i>eie v y (2.4) 

ei,...,e„=±l l<n Kl' 

Given an n x n covariance matrix Q let £'(Q) = (£'(<&,/')) an d define 

= \ y, (^o-^w) + (i-^)%mO) 

l<l,l'<n 

- J2 \i' u W + - logE< /n (z, A) (2.5) 



a 

Kl' 



where z has Gaussian distribution N(0,£'(Q)). 

Lemma 1 If n < a then 

liminf-^-logE^ /n (t/) > supinf^(Q,A). (2.6) 
N^oo Na Q a 

Proof. Let us define Z t {U) by replacing Hn(ct 1 ) in the definition of the partition function 
Z n {U) with Hamiltonians 

VtH N ((T l ) + VT^tJ2 z iM 

i<N 
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where {z^i)i< n are independent copies of z for i < N and let 



<p(t) = £]ogEZ? n (U)- 

By Gaussian integration by parts 

m = \ J2 (e(^o-^w) + (i-^)%MO^ 

l<l,l'<n 

+ E E5'<A(^', ft>I 0>. (2.7) 

l<l,l'<n 



The Gibbs average (•) in the last term is taken over two copies x and R 1,1 ' denotes the 
overlap between configuration er l from the first copy and configuration p l from the second 
copy Since n < a and A > 0, the last term in (2.7) is nonnegative and, therefore, 

<p(i)><p(o) + \ E (e(M-^w) + (i-^)%M0)- 

l<Z,/'<n 

It should be obvious from definitions that for all A 



¥>(0) < -^A M ^ + -logE^(z,A). 



i<v 

However, using standard large deviations techniques one can show that this bound is sharp 



lim y?(0) = inf(-^A M m M , + - log A)) . 

Since the choice of Q was arbitrary, this finishes the proof of Lemma. 

□ 

If a < ri + 1 then exactly the same proof will produce the upper bound for the right hand 
side of (2.3) because the last term in (2.7), with n now replaced by n + 1, will be negative. 
Let P be an (n + 1) x (n + 1) covariance matrix and let y be a Gaussian random vector with 
distribution jV(0,£'(P)). Given 7 = (7 M ')i< Mi < n+ i we define 

* (P ' 7) = ^^TT E (eKiO-wi,i^(PwO + (i-^)«(pwO 

1<M'<"+1 ' 



* T £ -W.''^' + -logE^^Cy.T). (2-8) 



o/(n+l) , 



n . . 

l<Kl'<n+l 



Lemma 2 If a < n + I then 



n 



, logEZ n+1 (W) a/(n+1) < infmf*(P, 7 ). (2.9) 
iva p t 
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Remark. This upper bound was given in Section 8 of [8] and the question answered here 
(only for p-spin model) was posed as an open problem there. In order to prove Theorem 2, 
the first urge is to try to find parameters P and 7 that would witness (1.16). However, this 
direct approach seems intractable. In fact, to understand the difficulty, one should look at 
the simplest case of integer a for which the answer is provided by Lieb's argument in [9]. 
Suppose that a — n + 1. Then taking P = W and 7 = in (2.9) yields 

^logEZ a (W0 < log^exp(^£> M ,)<7^, + /iJ>,) 

cr 1,1' l<a 1,1' 

= log ^exp^e'K,,)^' + - + ^ £'(!)• 

cr l<l' l<a 1,1' l<a 

An ingenious argument in [9] then shows that the supremum of the right hand side over W 
is achieved on the diagonal when all wi,i> = w and is strictly less for W off the diagonal. It is 
easy to see that for a constant matrix W the above bound becomes aRS(w) and, as a result, 
we obtain exponential control for values of the overlap that do not maximize RS(g). We do 
not see how to extend Lieb's argument for non-integer values of a, in particular, because the 
last term in (2.8) is much less explicit in P. Our approach will be quite different and the 
main idea will be to relate upper and lower bounds of Lemmas 1 and 2. It will follow from 
the argument below that, similarly to the integer case, the bound (2.9) is always maximized 
on the diagonal. 

□ 

Lemmas 1 and 2 and (2.3) imply that 

supinf^(Q,A) < infinf ^(P, 7 ). (2.10) 

Q x p 1 

To prove Theorem 2 we need to extract useful information from comparing these upper and 
lower bounds. Let us start by rewriting the first line in (2.8). Recalling (2.2) and regrouping 
the terms we can write it as I + II + III where 



j In 



2n+ _ 

Kl,l'<n 



II = 1 " 



2n + 



Y^ 1 ) ^ f'(Pn+l,n+l) + (1 - a)0(p n+ i >n+ i)). 



l<l,l'<n l<l<n 

2{n+l) 



+ 



na 

KKn 



E i^Ul) - U,£'(pj,n+l) + 9{ Pl 



Terms I and II were defined to match similar terms in the definition of ip and RS and III 
ensures that the sum of all three gives the first line in (2.8). We would like to choose the 
matrix P such that III < 0. Unfortunately, at this point we were able to do this only in the 
case when = /5 2 |a;| p and this is the only part of the proof that uses the specific choice 
of £ in Theorem 2. 
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Lemma 3 Suppose that ^(x) = (3 2 \x\ p . If we take 

^ ' l<l<n 

then P T = P > and III = 0. 

Proof. We can write P = a a T for a = (su T , s _1 ) T and, thus, (Px, x) = (a T x, a T x) > 0. Since 
6(x) = /3 2 (p — plugging this choice of P into III gives 



ap 2 (p— 1 \ n / n 

^ v ^ > l<l,l'<n l<l<n 

= s" 2p + s 2p \u\f - 2\u\ p p = 0, 
with the above (optimal) choice of s. 

□ 

From now on assume that P is defined by (2.11). Let 

Q = s 2 uu T = \u\~ l uu T and q = p n +i, n +i = s~ 2 = \u\ p . (2-12) 
Then the first line in (2.8) is equal to I + II where 

i = ^\ E + (2 - 13) 



II = 



l<l,l'<n 

n 1 



n + 



r g (e(l)-^) + (!-«)%)) ■ (2.14) 



Let us now look at the second line in (2.8). Let us take 7 such that 7; in+ i = for all I < n 
and let us rename 7^/ = A/,// for 1 < I < I' < n. Then, obviously, 

E lw w i,v = E \i' u hv- ( 2 - 15 ) 

l<W<n+l l<Kl'<n 

Given a random vector y with distribution iV(0, £'(P)) let = (2, z) so that z has distribu- 
tion N(0,£'(Q)) and z has distribution N(0,£'(q)). With the above choice of 7 it should be 
obvious that 

$n+i(y, 7) = $ n (z, A) x 2ch(2; + h) 
and, therefore, by Holder's inequality 

-logE$2S +1) (i/,7) < f-logE$^ t (^ ) A) + -logE(2ch(^ + /i)) aN ). (2.16) 

a n+lva a / 

Combining this with (2.13), (2.14) and (2.15) proves that 

*(P, 7 ) < -4t iKQ, A) + - J- RS(g) (2.17) 

9 



and (2.10) implies 

inf^(Q,A) < sup inf ^(Q', A) < inf inf MP, 7) < inf^(Q,A) + - RS(g). 

a v 7 _ q/ a v y_ P'-r y_ n+l A rv ^' n + 1 

Solving this inequality gives 

inf^(Q,A) < nRS(g) =nRS(|«| p ). 

Plugging this back into (2.17) gives 

inf *(P,7) < raRSfluL). (2.18) 

Let us consider two alternatives - either all elements of u are equal in absolute value or not 
In the first case \u\ p = \u\ and Lemma 2 implies that 

-±-\ og EZ n+1 (Wy^ < RS(K) = RS(|«|), 

which finishes the proof of Theorem 2 for h = or h 7^ and u > 0. 

Therefore, it remains to consider the cases when either not all elements of u are equal 
in absolute value or h 7^ and u < 0. The following holds. 

Lemma 4 // either not all elements of u are equal in absolute value or h 7^ and u < 
£/ien £/ie inequality (2.16) is strict. 

Remark. For simplicity of the proof, we will use the particular choices of = /3 2 |x| p 
and P in (2.11). However, it should be easy to generalize the proof for general £. Lemma 3 
is the only place where the specific form of £ was essential. 

Proof. Holder's inequality in (2.16) will be equality only if 

$ n (z,A) = const • ch(^ + /i) n (2.19) 
almost surely. However, since (z,z) have normal distribution with covariance £'(P), 

Zi = ai z for ai = ^'{s 2 )^(ui). (2.20) 

For e e {-l,+l} n , let 

1 l<Kn l<W<n 

/2(e) = ^F^{ h E e i) 

2 l<Z<n 

be two probability functions on { — 1, +l} n , where W 1 and W 2 are corresponding normalizing 
factors. Recalling the definition of $ n and using (2.20), (2.19) can be rewritten as 

£/i(e)exp(z £ a,e,) = ]T / 2 (e) exp(z ]T e,). (2.21) 

e KKn e KKn 
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almost surely for z and, since both sides are continuous, for all z6l. Letting z — > oo implies 
that \ai\ = n and by (2.20), 

l - E ir(«i)i = ^- 2 ) = e / (i«i P ). 

l</<n 

Since = p/5 2 |x| p_1 sgn(a;), this is equivalent to |it| p -i = |w| p which can happen only if 
all elements of u are equal in absolute value. If not, this proves that the inequality (2.16) is 
strict. If they are equal, then all a>i = ±1 and it remains to consider the case h ^ and u < 0. 
Equation (2.21) means that moment generating functions of Y a i e i under the law with p.f. 
fi and of £ i under the law with p.f. f 2 are equal and, therefore, their distributions are 
equal. For example, 

Pl(E a l £ l = n ) = P 2(E £ l = n )- 

l<l<n l<l<n 

If we denote s = (sgn(ai), . . . , sgn(a ra )) and 1 = (1, . . . , 1) then the equality of these proba- 
bilities is equivalent to /i(s) = ^(l)- Similarly, replacing n with — n gives fi(—s) = f 2 (— 1) 
and 

= / 2 (l)// 2 (-l). 

This implies that h Y sgn(a/) = hn. If h ^ then all a/ = 1 that contradicts that u < for 
which a t = ?(s 2 )?(u) < 0. 

□ 

We are ready to finish the proof of Theorem 2. Under the conditions of Lemma 4, the 
inequality in (2.17) will be strict for all A. However, in the computation leading from (2.17) 
to (2.18) we only really need to use this (strict) inequality for Ao such that 

V(Q,A ) = inf^(Q,A) (2.22) 

By convexity, such A exists and is unique. Then the same computation gives that 

inf *(P,7) < nRSfluL) < V(a) (2.23) 
i 

and Lemma 2 finishes the proof of Theorem. One small technical issue that needs to be 
mentioned is that if some w^v = ±1 then corresponding A^/ — > ±oo in the infimum (2.22). 
However, this does not cause a problem, it simply forces the corresponding coordinates in 
$ n to 'glue' together by forcing ei = Ey, but the rest of the argument remains the same. 

□ 
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